Dataset info
| Number of variables | 57 |
|---|---|
| Number of observations | 28235 |
| Missing cells | 448424 (27.9%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 9.5 MiB |
| Average record size in memory | 352.0 B |
Variables types
| Numeric | 8 |
|---|---|
| Categorical | 5 |
| Boolean | 0 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 0 |
| Rejected | 44 |
| Unsupported | 0 |
Warnings
city_levenshtein_simple has 18290 (64.8%) missing values | Missing |
city_levenshtein_simple_bin is highly correlated with city_levenshtein_simple (ρ = 0.9850689705) | Rejected |
city_levenshtein_term is highly correlated with city_levenshtein_simple_bin (ρ = 0.9488635718) | Rejected |
city_levenshtein_term_bin is highly correlated with city_levenshtein_term (ρ = 0.9930704216) | Rejected |
city_trigram_simple is highly correlated with city_levenshtein_term_bin (ρ = 0.9252824659) | Rejected |
city_trigram_simple_bin is highly correlated with city_trigram_simple (ρ = 0.9937484473) | Rejected |
city_trigram_term is highly correlated with city_trigram_simple_bin (ρ = 0.9305819585) | Rejected |
city_trigram_term_bin is highly correlated with city_trigram_term (ρ = 0.9967797356) | Rejected |
fax_levenshtein has 27505 (97.4%) missing values | Missing |
fax_levenshtein_bin is highly correlated with fax_levenshtein (ρ = 0.988964125) | Rejected |
fax_trigram is highly correlated with fax_levenshtein_bin (ρ = 0.9516868396) | Rejected |
fax_trigram_bin is highly correlated with fax_trigram (ρ = 0.9925748954) | Rejected |
id has a high cardinality: 27821 distinct values | Warning |
name_levenshtein_simple_bin is highly correlated with name_levenshtein_simple (ρ = 0.9735343526) | Rejected |
name_levenshtein_term_bin is highly correlated with name_levenshtein_term (ρ = 0.9805968951) | Rejected |
name_trigram_simple is highly correlated with name_levenshtein_simple_bin (ρ = 0.9636431124) | Rejected |
name_trigram_simple_bin is highly correlated with name_trigram_simple (ρ = 0.9845669582) | Rejected |
name_trigram_term is highly correlated with name_trigram_simple_bin (ρ = 0.9446397504) | Rejected |
name_trigram_term_bin is highly correlated with name_trigram_term (ρ = 0.9865189142) | Rejected |
phone_levenshtein has 16371 (58.0%) missing values | Missing |
phone_levenshtein_bin is highly correlated with phone_levenshtein (ρ = 0.9901385307) | Rejected |
phone_trigram is highly correlated with phone_levenshtein_bin (ρ = 0.9515029939) | Rejected |
phone_trigram_bin is highly correlated with phone_trigram (ρ = 0.9915532963) | Rejected |
street_levenshtein_simple has 19997 (70.8%) missing values | Missing |
street_levenshtein_simple_bin is highly correlated with street_levenshtein_simple (ρ = 0.977604212) | Rejected |
street_levenshtein_term is highly correlated with street_levenshtein_simple_bin (ρ = 0.9253512513) | Rejected |
street_levenshtein_term_bin is highly correlated with street_levenshtein_term (ρ = 0.9816441933) | Rejected |
street_number_levenshtein is highly correlated with street_number_equality (ρ = 0.9128880932) | Rejected |
street_number_levenshtein_bin is highly correlated with street_number_levenshtein (ρ = 0.9916964742) | Rejected |
street_number_trigram is highly correlated with street_number_levenshtein_bin (ρ = 0.941348484) | Rejected |
street_number_trigram_bin is highly correlated with street_number_trigram (ρ = 0.996232939) | Rejected |
street_trigram_simple is highly correlated with street_levenshtein_term_bin (ρ = 0.931397809) | Rejected |
street_trigram_simple_bin is highly correlated with street_trigram_simple (ρ = 0.9874532905) | Rejected |
street_trigram_term is highly correlated with street_trigram_simple_bin (ρ = 0.9655146219) | Rejected |
street_trigram_term_bin is highly correlated with street_trigram_term (ρ = 0.9886679289) | Rejected |
website_levenshtein_simple has 26416 (93.6%) missing values | Missing |
website_levenshtein_simple_bin is highly correlated with website_levenshtein_simple (ρ = 0.9724771988) | Rejected |
website_levenshtein_term is highly correlated with website_levenshtein_simple_bin (ρ = 0.9296893974) | Rejected |
website_levenshtein_term_bin is highly correlated with website_levenshtein_term (ρ = 0.9853659607) | Rejected |
website_trigram_simple is highly correlated with website_levenshtein_term_bin (ρ = 0.9424270326) | Rejected |
website_trigram_simple_bin is highly correlated with website_trigram_simple (ρ = 0.9746094625) | Rejected |
website_trigram_term is highly correlated with website_trigram_simple_bin (ρ = 0.9245284537) | Rejected |
website_trigram_term_bin is highly correlated with website_trigram_term (ρ = 0.9853001957) | Rejected |
zip_levenshtein_simple has 299 (1.1%) zeros | Zeros |
zip_levenshtein_simple has 20539 (72.7%) missing values | Missing |
zip_levenshtein_simple_bin is highly correlated with zip_levenshtein_simple (ρ = 0.9674268288) | Rejected |
zip_levenshtein_term is highly correlated with zip_levenshtein_simple_bin (ρ = 0.9590540554) | Rejected |
zip_levenshtein_term_bin is highly correlated with zip_levenshtein_term (ρ = 0.9684604865) | Rejected |
zip_trigram_simple is highly correlated with zip_levenshtein_term (ρ = 0.9081318961) | Rejected |
zip_trigram_simple_bin is highly correlated with zip_trigram_simple (ρ = 0.9967649793) | Rejected |
zip_trigram_term is highly correlated with zip_trigram_simple_bin (ρ = 0.9924286597) | Rejected |
zip_trigram_term_bin is highly correlated with zip_trigram_term (ρ = 0.9967692543) | Rejected |
city_levenshtein_simple
Numeric
| Distinct count | 218 |
|---|---|
| Unique (%) | 0.8% |
| Missing (%) | 64.8% |
| Missing (n) | 18290 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.8824213147 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros (%) | 0.5% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1793719977 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.2514716387 |
|---|---|
| Coef of variation | 0.284979105 |
| Kurtosis | 3.665280581 |
| Mean | 0.8824213147 |
| MAD | 0.1819805503 |
| Skewness | -2.17834568 |
| Sum | 8775.679688 |
| Variance | 0.06323798746 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 7685 | 27.2% | |
| 0.6666669846 | 770 | 2.7% | |
| 0.5714290142 | 211 | 0.7% | |
| 0.8333330154 | 134 | 0.5% | |
| 0 | 133 | 0.5% | |
| 0.1000000015 | 54 | 0.2% | |
| 0.5 | 44 | 0.2% | |
| 0.1666669995 | 44 | 0.2% | |
| 0.125 | 37 | 0.1% | |
| 0.25 | 36 | 0.1% | |
| Other values (207) | 797 | 2.8% | |
| (Missing) | 18290 | 64.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 133 | 0.5% | |
| 0.05000000075 | 1 | < 0.1% | |
| 0.05555560067 | 3 | < 0.1% | |
| 0.05714289844 | 1 | < 0.1% | |
| 0.06060609967 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 7685 | 27.2% | |
| 0.962962985 | 2 | < 0.1% | |
| 0.9583330154 | 4 | < 0.1% | |
| 0.9523810148 | 2 | < 0.1% | |
| 0.9444440007 | 3 | < 0.1% |
city_levenshtein_simple_bin
Highly correlated
This variable is highly correlated with city_levenshtein_simple and should be ignored for analysis
| Correlation | 0.9850689705 |
|---|
city_levenshtein_term
Highly correlated
This variable is highly correlated with city_levenshtein_simple_bin and should be ignored for analysis
| Correlation | 0.9488635718 |
|---|
city_levenshtein_term_bin
Highly correlated
This variable is highly correlated with city_levenshtein_term and should be ignored for analysis
| Correlation | 0.9930704216 |
|---|
city_trigram_simple
Highly correlated
This variable is highly correlated with city_levenshtein_term_bin and should be ignored for analysis
| Correlation | 0.9252824659 |
|---|
city_trigram_simple_bin
Highly correlated
This variable is highly correlated with city_trigram_simple and should be ignored for analysis
| Correlation | 0.9937484473 |
|---|
city_trigram_term
Highly correlated
This variable is highly correlated with city_trigram_simple_bin and should be ignored for analysis
| Correlation | 0.9305819585 |
|---|
city_trigram_term_bin
Highly correlated
This variable is highly correlated with city_trigram_term and should be ignored for analysis
| Correlation | 0.9967797356 |
|---|
fax_equality
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 2 | 386 |
| 1 | 344 |
| Value | Count | Frequency (%) | |
| 0 | 27505 | 97.4% | |
| 2 | 386 | 1.4% | |
| 1 | 344 | 1.2% |
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
fax_levenshtein
Numeric
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 97.4% |
| Missing (n) | 27505 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.7646326423 |
|---|---|
| Minimum | 0.1000000015 |
| Maximum | 1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.1000000015 |
|---|---|
| 5-th percentile | 0.3000000119 |
| Q1 | 0.5 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 0.8999999762 |
| Interquartile range | 0.5 |
Descriptive statistics
| Standard deviation | 0.2792602479 |
|---|---|
| Coef of variation | 0.3652214706 |
| Kurtosis | -1.050391316 |
| Mean | 0.7646326423 |
| MAD | 0.2558975518 |
| Skewness | -0.6480944753 |
| Sum | 558.1818237 |
| Variance | 0.07798628509 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 386 | 1.4% | |
| 0.5 | 44 | 0.2% | |
| 0.400000006 | 41 | 0.1% | |
| 0.6000000238 | 38 | 0.1% | |
| 0.5454545617 | 37 | 0.1% | |
| 0.4545454681 | 30 | 0.1% | |
| 0.3000000119 | 29 | 0.1% | |
| 0.6999999881 | 22 | 0.1% | |
| 0.7272727489 | 21 | 0.1% | |
| 0.200000003 | 17 | 0.1% | |
| Other values (8) | 65 | 0.2% | |
| (Missing) | 27505 | 97.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.1000000015 | 12 | < 0.1% | |
| 0.200000003 | 17 | 0.1% | |
| 0.2727272809 | 5 | < 0.1% | |
| 0.3000000119 | 29 | 0.1% | |
| 0.3636363745 | 15 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 386 | 1.4% | |
| 0.9090909362 | 13 | < 0.1% | |
| 0.8999999762 | 3 | < 0.1% | |
| 0.8181818128 | 3 | < 0.1% | |
| 0.8000000119 | 3 | < 0.1% |
fax_levenshtein_bin
Highly correlated
This variable is highly correlated with fax_levenshtein and should be ignored for analysis
| Correlation | 0.988964125 |
|---|
fax_trigram
Highly correlated
This variable is highly correlated with fax_levenshtein_bin and should be ignored for analysis
| Correlation | 0.9516868396 |
|---|
fax_trigram_bin
Highly correlated
This variable is highly correlated with fax_trigram and should be ignored for analysis
| Correlation | 0.9925748954 |
|---|
id
Categorical
| Distinct count | 27821 |
|---|---|
| Unique (%) | 98.5% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 11287#11288 | 2 |
|---|---|
| 12999#13000 | 2 |
| 12367#12368 | 2 |
| Other values (27818) |
| Value | Count | Frequency (%) | |
| 11287#11288 | 2 | < 0.1% | |
| 12999#13000 | 2 | < 0.1% | |
| 12367#12368 | 2 | < 0.1% | |
| 12425#12426 | 2 | < 0.1% | |
| 12569#12570 | 2 | < 0.1% | |
| 12018#12019 | 2 | < 0.1% | |
| 12575#12576 | 2 | < 0.1% | |
| 13021#13022 | 2 | < 0.1% | |
| 12710#12711 | 2 | < 0.1% | |
| 12450#12451 | 2 | < 0.1% | |
| Other values (27811) | 28215 | 99.9% |
| Max length | 13 |
|---|---|
| Mean length | 9.077173721 |
| Min length | 3 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
is_match
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 | |
|---|---|
| -1 |
| Value | Count | Frequency (%) | |
| 1 | 20262 | 71.8% | |
| -1 | 7973 | 28.2% |
| Max length | 2 |
|---|---|
| Mean length | 1.282380025 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
name_levenshtein_simple
Numeric
| Distinct count | 3511 |
|---|---|
| Unique (%) | 12.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.6281707883 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros (%) | 0.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1357139945 |
| Q1 | 0.3633864969 |
| Median | 0.6666669846 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range | 0.6366135031 |
Descriptive statistics
| Standard deviation | 0.3047668338 |
|---|---|
| Coef of variation | 0.4851655662 |
| Kurtosis | -1.23851192 |
| Mean | 0.6281707883 |
| MAD | 0.2647316456 |
| Skewness | -0.2540036738 |
| Sum | 17736.40234 |
| Variance | 0.09288282692 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 7152 | 25.3% | |
| 0.6666669846 | 2281 | 8.1% | |
| 0.8000000119 | 1598 | 5.7% | |
| 0.8571429849 | 1028 | 3.6% | |
| 0.5 | 957 | 3.4% | |
| 0.5714290142 | 600 | 2.1% | |
| 0.75 | 533 | 1.9% | |
| 0.400000006 | 472 | 1.7% | |
| 0.8888890147 | 438 | 1.6% | |
| 0.3333329856 | 296 | 1.0% | |
| Other values (3501) | 12880 | 45.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 212 | 0.8% | |
| 0.02597399987 | 1 | < 0.1% | |
| 0.02857140079 | 1 | < 0.1% | |
| 0.03333330154 | 3 | < 0.1% | |
| 0.03571429849 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 7152 | 25.3% | |
| 0.9772729874 | 7 | < 0.1% | |
| 0.9750000238 | 16 | 0.1% | |
| 0.9714289904 | 2 | < 0.1% | |
| 0.9666669965 | 2 | < 0.1% |
name_levenshtein_simple_bin
Highly correlated
This variable is highly correlated with name_levenshtein_simple and should be ignored for analysis
| Correlation | 0.9735343526 |
|---|
name_levenshtein_term
Numeric
| Distinct count | 727 |
|---|---|
| Unique (%) | 2.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.5370063186 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros (%) | 0.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1111110002 |
| Q1 | 0.243242994 |
| Median | 0.4761900008 |
| Q3 | 0.875 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range | 0.631757006 |
Descriptive statistics
| Standard deviation | 0.3248197734 |
|---|---|
| Coef of variation | 0.6048713923 |
| Kurtosis | -1.379292369 |
| Mean | 0.5370063186 |
| MAD | 0.287050277 |
| Skewness | 0.2472924888 |
| Sum | 15162.37402 |
| Variance | 0.1055078804 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 6569 | 23.3% | |
| 0.25 | 528 | 1.9% | |
| 0.5 | 487 | 1.7% | |
| 0.200000003 | 464 | 1.6% | |
| 0.3333329856 | 444 | 1.6% | |
| 0.6666669846 | 330 | 1.2% | |
| 0.5714290142 | 311 | 1.1% | |
| 0.1428570002 | 284 | 1.0% | |
| 0.2142859995 | 278 | 1.0% | |
| 0.1666669995 | 278 | 1.0% | |
| Other values (717) | 18262 | 64.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 255 | 0.9% | |
| 0.02439020015 | 1 | < 0.1% | |
| 0.02597399987 | 1 | < 0.1% | |
| 0.02857140079 | 1 | < 0.1% | |
| 0.02941180021 | 6 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 6569 | 23.3% | |
| 0.9787229896 | 2 | < 0.1% | |
| 0.9750000238 | 9 | < 0.1% | |
| 0.9705880284 | 1 | < 0.1% | |
| 0.9666669965 | 1 | < 0.1% |
name_levenshtein_term_bin
Highly correlated
This variable is highly correlated with name_levenshtein_term and should be ignored for analysis
| Correlation | 0.9805968951 |
|---|
name_trigram_simple
Highly correlated
This variable is highly correlated with name_levenshtein_simple_bin and should be ignored for analysis
| Correlation | 0.9636431124 |
|---|
name_trigram_simple_bin
Highly correlated
This variable is highly correlated with name_trigram_simple and should be ignored for analysis
| Correlation | 0.9845669582 |
|---|
name_trigram_term
Highly correlated
This variable is highly correlated with name_trigram_simple_bin and should be ignored for analysis
| Correlation | 0.9446397504 |
|---|
name_trigram_term_bin
Highly correlated
This variable is highly correlated with name_trigram_term and should be ignored for analysis
| Correlation | 0.9865189142 |
|---|
phone_equality
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 2 | |
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 16369 | 58.0% | |
| 2 | 7863 | 27.8% | |
| 1 | 4003 | 14.2% |
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
phone_levenshtein
Numeric
| Distinct count | 29 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 58.0% |
| Missing (n) | 16371 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.8439607024 |
|---|---|
| Minimum | 0.1000000015 |
| Maximum | 1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.1000000015 |
|---|---|
| 5-th percentile | 0.2727272809 |
| Q1 | 0.6999999881 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 0.8999999762 |
| Interquartile range | 0.3000000119 |
Descriptive statistics
| Standard deviation | 0.2519796193 |
|---|---|
| Coef of variation | 0.2985679507 |
| Kurtosis | 0.4257274568 |
| Mean | 0.8439607024 |
| MAD | 0.2102880329 |
| Skewness | -1.351725817 |
| Sum | 10012.75 |
| Variance | 0.06349372119 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 7863 | 27.8% | |
| 0.5454545617 | 409 | 1.4% | |
| 0.4545454681 | 408 | 1.4% | |
| 0.8181818128 | 348 | 1.2% | |
| 0.6363636255 | 332 | 1.2% | |
| 0.3636363745 | 273 | 1.0% | |
| 0.9090909362 | 251 | 0.9% | |
| 0.5 | 249 | 0.9% | |
| 0.6000000238 | 230 | 0.8% | |
| 0.7272727489 | 196 | 0.7% | |
| Other values (18) | 1305 | 4.6% | |
| (Missing) | 16371 | 58.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.1000000015 | 89 | 0.3% | |
| 0.1666666716 | 9 | < 0.1% | |
| 0.1818181872 | 146 | 0.5% | |
| 0.200000003 | 154 | 0.5% | |
| 0.25 | 17 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 7863 | 27.8% | |
| 0.9166666865 | 20 | 0.1% | |
| 0.9090909362 | 251 | 0.9% | |
| 0.8999999762 | 48 | 0.2% | |
| 0.8333333135 | 24 | 0.1% |
phone_levenshtein_bin
Highly correlated
This variable is highly correlated with phone_levenshtein and should be ignored for analysis
| Correlation | 0.9901385307 |
|---|
phone_trigram
Highly correlated
This variable is highly correlated with phone_levenshtein_bin and should be ignored for analysis
| Correlation | 0.9515029939 |
|---|
phone_trigram_bin
Highly correlated
This variable is highly correlated with phone_trigram and should be ignored for analysis
| Correlation | 0.9915532963 |
|---|
street_levenshtein_simple
Numeric
| Distinct count | 1717 |
|---|---|
| Unique (%) | 6.1% |
| Missing (%) | 70.8% |
| Missing (n) | 19997 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.6922866106 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros (%) | 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1666669995 |
| Q1 | 0.3857277632 |
| Median | 0.8000000119 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range | 0.6142722368 |
Descriptive statistics
| Standard deviation | 0.3100984395 |
|---|---|
| Coef of variation | 0.4479336143 |
| Kurtosis | -1.263106465 |
| Mean | 0.6922866106 |
| MAD | 0.2781056166 |
| Skewness | -0.5044587851 |
| Sum | 5703.057129 |
| Variance | 0.09616104513 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 2845 | 10.1% | |
| 0.8571429849 | 358 | 1.3% | |
| 0.6666669846 | 220 | 0.8% | |
| 0.75 | 210 | 0.7% | |
| 0.8000000119 | 203 | 0.7% | |
| 0.8888890147 | 145 | 0.5% | |
| 0.8333330154 | 79 | 0.3% | |
| 0.5 | 73 | 0.3% | |
| 0.3333329856 | 62 | 0.2% | |
| 0.875 | 53 | 0.2% | |
| Other values (1706) | 3990 | 14.1% | |
| (Missing) | 19997 | 70.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 22 | 0.1% | |
| 0.02777780034 | 1 | < 0.1% | |
| 0.02857140079 | 1 | < 0.1% | |
| 0.03333330154 | 2 | < 0.1% | |
| 0.03846149892 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 2845 | 10.1% | |
| 0.9841269851 | 10 | < 0.1% | |
| 0.9818180203 | 4 | < 0.1% | |
| 0.9814810157 | 1 | < 0.1% | |
| 0.9807689786 | 1 | < 0.1% |
street_levenshtein_simple_bin
Highly correlated
This variable is highly correlated with street_levenshtein_simple and should be ignored for analysis
| Correlation | 0.977604212 |
|---|
street_levenshtein_term
Highly correlated
This variable is highly correlated with street_levenshtein_simple_bin and should be ignored for analysis
| Correlation | 0.9253512513 |
|---|
street_levenshtein_term_bin
Highly correlated
This variable is highly correlated with street_levenshtein_term and should be ignored for analysis
| Correlation | 0.9816441933 |
|---|
street_number_equality
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2 | |
|---|---|
| 1 | |
| 0 | 762 |
| Value | Count | Frequency (%) | |
| 2 | 14064 | 49.8% | |
| 1 | 13409 | 47.5% | |
| 0 | 762 | 2.7% |
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
street_number_levenshtein
Highly correlated
This variable is highly correlated with street_number_equality and should be ignored for analysis
| Correlation | 0.9128880932 |
|---|
street_number_levenshtein_bin
Highly correlated
This variable is highly correlated with street_number_levenshtein and should be ignored for analysis
| Correlation | 0.9916964742 |
|---|
street_number_trigram
Highly correlated
This variable is highly correlated with street_number_levenshtein_bin and should be ignored for analysis
| Correlation | 0.941348484 |
|---|
street_number_trigram_bin
Highly correlated
This variable is highly correlated with street_number_trigram and should be ignored for analysis
| Correlation | 0.996232939 |
|---|
street_trigram_simple
Highly correlated
This variable is highly correlated with street_levenshtein_term_bin and should be ignored for analysis
| Correlation | 0.931397809 |
|---|
street_trigram_simple_bin
Highly correlated
This variable is highly correlated with street_trigram_simple and should be ignored for analysis
| Correlation | 0.9874532905 |
|---|
street_trigram_term
Highly correlated
This variable is highly correlated with street_trigram_simple_bin and should be ignored for analysis
| Correlation | 0.9655146219 |
|---|
street_trigram_term_bin
Highly correlated
This variable is highly correlated with street_trigram_term and should be ignored for analysis
| Correlation | 0.9886679289 |
|---|
website_levenshtein_simple
Numeric
| Distinct count | 313 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 93.6% |
| Missing (n) | 26416 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.8158383965 |
|---|---|
| Minimum | 0.174999997 |
| Maximum | 1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.174999997 |
|---|---|
| 5-th percentile | 0.4113099873 |
| Q1 | 0.6153849959 |
| Median | 0.9736840129 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 0.8249999881 |
| Interquartile range | 0.3846150041 |
Descriptive statistics
| Standard deviation | 0.2207053304 |
|---|---|
| Coef of variation | 0.2705257833 |
| Kurtosis | -0.8028762341 |
| Mean | 0.8158383965 |
| MAD | 0.1942105144 |
| Skewness | -0.7789297104 |
| Sum | 1484.01001 |
| Variance | 0.04871084541 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 906 | 3.2% | |
| 0.8571429849 | 38 | 0.1% | |
| 0.8888890147 | 31 | 0.1% | |
| 0.8000000119 | 30 | 0.1% | |
| 0.75 | 29 | 0.1% | |
| 0.5 | 22 | 0.1% | |
| 0.7142860293 | 21 | 0.1% | |
| 0.8412700295 | 20 | 0.1% | |
| 0.7272729874 | 19 | 0.1% | |
| 0.5066670179 | 18 | 0.1% | |
| Other values (302) | 685 | 2.4% | |
| (Missing) | 26416 | 93.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.174999997 | 2 | < 0.1% | |
| 0.2166340053 | 1 | < 0.1% | |
| 0.2407049984 | 1 | < 0.1% | |
| 0.2592230141 | 2 | < 0.1% | |
| 0.2638890147 | 7 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 906 | 3.2% | |
| 0.9791669846 | 1 | < 0.1% | |
| 0.9772729874 | 1 | < 0.1% | |
| 0.9736840129 | 2 | < 0.1% | |
| 0.9666669965 | 6 | < 0.1% |
website_levenshtein_simple_bin
Highly correlated
This variable is highly correlated with website_levenshtein_simple and should be ignored for analysis
| Correlation | 0.9724771988 |
|---|
website_levenshtein_term
Highly correlated
This variable is highly correlated with website_levenshtein_simple_bin and should be ignored for analysis
| Correlation | 0.9296893974 |
|---|
website_levenshtein_term_bin
Highly correlated
This variable is highly correlated with website_levenshtein_term and should be ignored for analysis
| Correlation | 0.9853659607 |
|---|
website_trigram_simple
Highly correlated
This variable is highly correlated with website_levenshtein_term_bin and should be ignored for analysis
| Correlation | 0.9424270326 |
|---|
website_trigram_simple_bin
Highly correlated
This variable is highly correlated with website_trigram_simple and should be ignored for analysis
| Correlation | 0.9746094625 |
|---|
website_trigram_term
Highly correlated
This variable is highly correlated with website_trigram_simple_bin and should be ignored for analysis
| Correlation | 0.9245284537 |
|---|
website_trigram_term_bin
Highly correlated
This variable is highly correlated with website_trigram_term and should be ignored for analysis
| Correlation | 0.9853001957 |
|---|
zip_levenshtein_simple
Numeric
| Distinct count | 20 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 72.7% |
| Missing (n) | 20539 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.8938370347 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros (%) | 1.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.200000003 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.2479177564 |
|---|---|
| Coef of variation | 0.2773634791 |
| Kurtosis | 5.726059437 |
| Mean | 0.8938370347 |
| MAD | 0.1646519005 |
| Skewness | -2.591394424 |
| Sum | 6878.969727 |
| Variance | 0.06146321446 |
| Memory size | 110.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 5968 | 21.1% | |
| 0.8000000119 | 737 | 2.6% | |
| 0 | 299 | 1.1% | |
| 0.200000003 | 144 | 0.5% | |
| 0.6666669846 | 138 | 0.5% | |
| 0.25 | 110 | 0.4% | |
| 0.6000000238 | 109 | 0.4% | |
| 0.5 | 52 | 0.2% | |
| 0.8333330154 | 47 | 0.2% | |
| 0.400000006 | 42 | 0.1% | |
| Other values (9) | 50 | 0.2% | |
| (Missing) | 20539 | 72.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 299 | 1.1% | |
| 0.08333329856 | 4 | < 0.1% | |
| 0.1666669995 | 5 | < 0.1% | |
| 0.200000003 | 144 | 0.5% | |
| 0.2222220004 | 4 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1 | 5968 | 21.1% | |
| 0.8333330154 | 47 | 0.2% | |
| 0.8000000119 | 737 | 2.6% | |
| 0.75 | 24 | 0.1% | |
| 0.6666669846 | 138 | 0.5% |
zip_levenshtein_simple_bin
Highly correlated
This variable is highly correlated with zip_levenshtein_simple and should be ignored for analysis
| Correlation | 0.9674268288 |
|---|
zip_levenshtein_term
Highly correlated
This variable is highly correlated with zip_levenshtein_simple_bin and should be ignored for analysis
| Correlation | 0.9590540554 |
|---|
zip_levenshtein_term_bin
Highly correlated
This variable is highly correlated with zip_levenshtein_term and should be ignored for analysis
| Correlation | 0.9684604865 |
|---|
zip_trigram_simple
Highly correlated
This variable is highly correlated with zip_levenshtein_term and should be ignored for analysis
| Correlation | 0.9081318961 |
|---|
zip_trigram_simple_bin
Highly correlated
This variable is highly correlated with zip_trigram_simple and should be ignored for analysis
| Correlation | 0.9967649793 |
|---|
zip_trigram_term
Highly correlated
This variable is highly correlated with zip_trigram_simple_bin and should be ignored for analysis
| Correlation | 0.9924286597 |
|---|
zip_trigram_term_bin
Highly correlated
This variable is highly correlated with zip_trigram_term and should be ignored for analysis
| Correlation | 0.9967692543 |
|---|
First rows
| city_levenshtein_simple | city_levenshtein_simple_bin | city_levenshtein_term | city_levenshtein_term_bin | city_trigram_simple | city_trigram_simple_bin | city_trigram_term | city_trigram_term_bin | fax_equality | fax_levenshtein | fax_levenshtein_bin | fax_trigram | fax_trigram_bin | id | is_match | name_levenshtein_simple | name_levenshtein_simple_bin | name_levenshtein_term | name_levenshtein_term_bin | name_trigram_simple | name_trigram_simple_bin | name_trigram_term | name_trigram_term_bin | phone_equality | phone_levenshtein | phone_levenshtein_bin | phone_trigram | phone_trigram_bin | street_levenshtein_simple | street_levenshtein_simple_bin | street_levenshtein_term | street_levenshtein_term_bin | street_number_equality | street_number_levenshtein | street_number_levenshtein_bin | street_number_trigram | street_number_trigram_bin | street_trigram_simple | street_trigram_simple_bin | street_trigram_term | street_trigram_term_bin | website_levenshtein_simple | website_levenshtein_simple_bin | website_levenshtein_term | website_levenshtein_term_bin | website_trigram_simple | website_trigram_simple_bin | website_trigram_term | website_trigram_term_bin | zip_levenshtein_simple | zip_levenshtein_simple_bin | zip_levenshtein_term | zip_levenshtein_term_bin | zip_trigram_simple | zip_trigram_simple_bin | zip_trigram_term | zip_trigram_term_bin | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 1204#1207 | 1 | 0.666667 | 3 | 0.400000 | 2 | 0.666667 | 3 | 0.526316 | 2 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 2 | 1.0 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 1272#1279 | 1 | 0.666667 | 3 | 0.411765 | 2 | 0.666667 | 3 | 0.444444 | 2 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 2 | 1.0 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 2 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 6258#6259 | 1 | 1.000000 | 4 | 1.000000 | 4 | 1.000000 | 4 | 1.000000 | 4 | 2 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 | 2 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 | 1.00 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 |
| 3 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 16076#16077 | -1 | 0.565476 | 2 | 0.260870 | 1 | 0.333333 | 1 | 0.189189 | 0 | 2 | 1.0 | 4 | 1.0 | 4 | NaN | -1 | NaN | -1 | 1 | 0.0 | 0 | 0.000000 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 2666#2671 | 1 | 0.666667 | 3 | 0.500000 | 2 | 0.666667 | 3 | 0.518519 | 2 | 2 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 | 2 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 | 1.00 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 |
| 5 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 4402#4403 | 1 | 1.000000 | 4 | 1.000000 | 4 | 1.000000 | 4 | 1.000000 | 4 | 2 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 | 2 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 | 1.00 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.000000 | 4 |
| 6 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 4025#4028 | -1 | 0.121032 | 0 | 0.185185 | 0 | 0.000000 | 0 | 0.000000 | 0 | 0 | NaN | -1 | NaN | -1 | 0.380952 | 1 | 0.428571 | 2 | 1 | 0.5 | 2 | 0.250000 | 1 | 0.333333 | 1 | 0.16 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0.8 | 4 | 0.8 | 4 | 0.333333 | 1 | 0.333333 | 1 |
| 7 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 4126#4138 | -1 | 0.355556 | 1 | 0.333333 | 1 | 0.250000 | 1 | 0.159091 | 0 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1 | 0.6 | 3 | 0.222222 | 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 8 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 4559#4560 | -1 | 0.167298 | 0 | 0.173913 | 0 | 0.000000 | 0 | 0.000000 | 0 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 9 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 15610#15611 | -1 | 0.208333 | 1 | 0.266667 | 1 | 0.000000 | 0 | 0.000000 | 0 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
Last rows
| city_levenshtein_simple | city_levenshtein_simple_bin | city_levenshtein_term | city_levenshtein_term_bin | city_trigram_simple | city_trigram_simple_bin | city_trigram_term | city_trigram_term_bin | fax_equality | fax_levenshtein | fax_levenshtein_bin | fax_trigram | fax_trigram_bin | id | is_match | name_levenshtein_simple | name_levenshtein_simple_bin | name_levenshtein_term | name_levenshtein_term_bin | name_trigram_simple | name_trigram_simple_bin | name_trigram_term | name_trigram_term_bin | phone_equality | phone_levenshtein | phone_levenshtein_bin | phone_trigram | phone_trigram_bin | street_levenshtein_simple | street_levenshtein_simple_bin | street_levenshtein_term | street_levenshtein_term_bin | street_number_equality | street_number_levenshtein | street_number_levenshtein_bin | street_number_trigram | street_number_trigram_bin | street_trigram_simple | street_trigram_simple_bin | street_trigram_term | street_trigram_term_bin | website_levenshtein_simple | website_levenshtein_simple_bin | website_levenshtein_term | website_levenshtein_term_bin | website_trigram_simple | website_trigram_simple_bin | website_trigram_term | website_trigram_term_bin | zip_levenshtein_simple | zip_levenshtein_simple_bin | zip_levenshtein_term | zip_levenshtein_term_bin | zip_trigram_simple | zip_trigram_simple_bin | zip_trigram_term | zip_trigram_term_bin | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 28225 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 2157#2168 | 1 | 0.857143 | 4 | 0.565217 | 2 | 0.857143 | 4 | 0.636364 | 3 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 2 | 1.000000 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28226 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 2875#2882 | 1 | 0.400000 | 2 | 0.133333 | 0 | 0.400000 | 2 | 0.238095 | 1 | 2 | 1.0 | 4 | 1.0 | 4 | NaN | -1 | NaN | -1 | 2 | 1.000000 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28227 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 2640#2644 | 1 | 0.285714 | 1 | 0.096774 | 0 | 0.285714 | 1 | 0.129032 | 0 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 2 | 1.000000 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28228 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 1400#1396 | 1 | 0.666667 | 3 | 0.454545 | 2 | 0.666667 | 3 | 0.478261 | 2 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28229 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 2018#2024 | 1 | 0.888889 | 4 | 0.806452 | 4 | 0.888889 | 4 | 0.806452 | 4 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 2 | 1.000000 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28230 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 7148#7149 | 1 | 1.000000 | 4 | 1.000000 | 4 | 1.000000 | 4 | 1.000000 | 4 | 2 | 1.0 | 4 | 1.0 | 4 | NaN | -1 | NaN | -1 | 2 | 1.000000 | 4 | 1.000000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28231 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 8851#8860 | 1 | 0.727273 | 3 | 0.620690 | 3 | 0.727273 | 3 | 0.628571 | 3 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |
| 28232 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 6468#6470 | -1 | 0.263889 | 1 | 0.384615 | 1 | 0.066667 | 0 | 0.040000 | 0 | 2 | 1.0 | 4 | 1.0 | 4 | 1.000000 | 4 | 1.00 | 4 | 2 | 1.000000 | 4 | 1.000000 | 4 | 1.00 | 4 | 1.0000 | 4 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0.0 | 0 | 0.0 | 0 | 0.0 | 0 | 0.0 | 0 |
| 28233 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 0 | NaN | -1 | NaN | -1 | 3516#3527 | -1 | 0.208571 | 1 | 0.187500 | 0 | 0.040000 | 0 | 0.041667 | 0 | 0 | NaN | -1 | NaN | -1 | 0.497633 | 2 | 0.45 | 2 | 1 | 0.666667 | 3 | 0.333333 | 1 | 0.35 | 1 | 0.1875 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 | 1.0 | 4 |
| 28234 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 0 | NaN | -1 | NaN | -1 | 12899#12477 | -1 | 0.527778 | 2 | 0.380952 | 1 | 0.347826 | 1 | 0.194444 | 0 | 0 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | 1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 | NaN | -1 |